On coresets for support vector machines
نویسندگان
چکیده
We present an efficient coreset construction algorithm for large-scale Support Vector Machine (SVM) training in Big Data and streaming applications. A is a small, representative subset of the original data points such that model trained on provably competitive with set. Since size generally much smaller than set, our preprocess-then-train scheme has potential to lead significant speedups when SVM models. prove lower upper bounds required obtain small summaries problem. As corollary, we show can be used extend applicability any off-the-shelf solver streaming, distributed, dynamic settings. evaluate performance real-world synthetic sets. Our experimental results reaffirm favorable theoretical properties demonstrate its practical effectiveness accelerating training.
منابع مشابه
Training Support Vector Machines using Coresets
Note: This work was done as a course project as part of an ongoing research effort that was recently submitted [2]. The submission, done in collaboration with Murad Tukan, Dan Feldman, and Daniela Rus [2], supersedes the work in this manuscript. We present a novel coreset construction algorithm for solving classification tasks using Support Vector Machines (SVMs) in a computationally efficient ...
متن کاملSTAGE-DISCHARGE MODELING USING SUPPORT VECTOR MACHINES
Establishment of rating curves are often required by the hydrologists for flow estimates in the streams, rivers etc. Measurement of discharge in a river is a time-consuming, expensive, and difficult process and the conventional approach of regression analysis of stage-discharge relation does not provide encouraging results especially during the floods. P
متن کاملOn Transductive Support Vector Machines
Transductive support vector machines (TSVM) has been widely used as a means of treating partially labeled data in semisupervised learning. Around it, there has been mystery because of lack of understanding its foundation in generalization. This article aims to clarify several controversial aspects regarding TSVM. Two main results are established. First, TSVM performs no worse than its supervise...
متن کاملOn Universum - Support Vector Machines ∗
Universum-support vector machine (U-SVM) is an elegant method for 2-class classification problem. It is systematically studied in this paper, including the existence and uniqueness of the primal problem as well as the relation between the solutions of primal problem and dual problem. We find that U-SVM uses 3-class classification approach to solve the 2-class classification problem. So we have ...
متن کاملOn Margin and Support Vector Separability in Support Vector Machines for Regression on Margin and Support Vector Separability in Support Vector Machines for Regression
In this report we show some simple properties of SVM for regression. In particular we show that for close to zero, minimizing the norm of w is equivalent to maximizing the distance between the optimal approximating hyperplane solution of SVMR and the closest points in the data set. So, in this case, there exists a complete analogy between SVM for regression and classiication, and the-tube plays...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Theoretical Computer Science
سال: 2021
ISSN: ['1879-2294', '0304-3975']
DOI: https://doi.org/10.1016/j.tcs.2021.09.008